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Abstract.  We  propose  a  biochemically  plausible  mechanism  for  constructing 
digital  logic  signals  and  gates  of  significant  complexity  within  living  cells.  These 
mechanisms  rely  largely  on  co-opting  existing  biochemical  machinery  and  bind¬ 
ing  proteins  found  naturally  within  the  cell,  replacing  difficult  protein  engineering 
problems  with  more  straightforward  engineering  of  novel  combinations  of  gene 
control  sequences  and  gene  coding  regions. 

The  resulting  logic  technology,  although  slow,  allows  us  to  engineer  the  chemical 
behavior  of  cells  for  use  as  sensors  and  effectors.  One  promising  use  of  such 
technology  is  the  control  of  fabrication  processes  at  the  molecular  scale. 


1  Introduction 

Cells  provide  an  isolated,  controlled  environment  for  carrying  out  complex  chemical 
reactions.  Moreover,  they  reproduce  themselves,  allowing  the  creation  of  many  copies 
with  little  manufacturing  effort.  The  ability  to  control  cellular  function  will  provide 
important  capabilities  in  computation,  materials  manufacturing,  sensing,  effecting,  and 
fabrication  at  the  molecular  scale. 

This  work  is  part  of  an  effort  to  learn  how  to  control  the  chemical  mechanisms 
of  the  cell,  by  co-opting  the  existing  biological  mechanism  and  by  constructing  novel 
mechanism.  One  particular  short-term  goal  is  to  engineer  chemical  mechanisms  which 
can  be  used  to  implement  the  digital  abstraction — the  notion  that  chemical  signals  can 
represent  logical  true  and  false  (or  zero  and  one)  values. 

Like  any  good  abstraction,  the  digital  abstraction  allows  us  to  ignore  the  fine  details 
of  a  complex  phenomenon,  and  concentrate  on  the  essentials  of  the  control  process. 

The  essential  features  of  any  digital  logic  implementation  include  the  ability  to  dis¬ 
tinguish  and  maintain  two  distinct  values  of  some  physical  representation  of  a  signal. 
This  requires  the  presence  of  adequate  noise  margins — an  ability  to  produce  outputs 
whose  physical  values  more  perfectly  represent  a  given  logical  value  than  the  physical 
representation  of  their  input.  Adequate  noise  margins  allow  noise  and  imperfections 
in  a  digital  system  to  be  reduced,  rather  than  amplified,  during  complex  information 
processing. 

This  work  attempts  to  define  a  series  of  biologically  plausible  chemical  reactions 
which  can  implement  such  a  digital  abstraction. 
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2  Building  on  Available  Biological  Mechanisms 

Our  strategy  builds  upon  existing  biological  mechanisms.  We  are  fortunate  that  the 
natural  world  exhibits  mechanisms  similar  to  those  a  good  engineer  would  design.  By 
using  a  modified  version  of  these  naturally  occurring  mechanisms  we  avoid  potentially 
very  challenging  issues  of  engineering  complex  protein-DNA  interactions,  allowing  us 
to  build  interesting  structures  with  a  mix  and  match  approach,  combined  with  limited 
modification. 

Here  we  show  the  feasibility  of  building  a  family  of  logic  gates  where  the  signals  are 
represented  by  concentrations  of  naturally-occurring  DNA-binding  proteins,  and  where 
the  nonlinear  amplification  is  implemented  by  in  vivo  DNA-directed  protein  synthesis. 
We  first  review  the  essential  aspects  of  protein  manufacturing  in  the  cell. 

2.1  Making  Proteins  within  a  Cell 

Proteins  are  ordered  molecular  polymers  of  50-1000  amino  acids,  of  20  different  types. 
Each  of  the  approximately  500- 10,000  protein  types  in  a  typical  cell  consists  of  a  unique 
sequence  of  amino  acids.  Moreover,  each  protein  chain  folds  into  a  characteristic  three- 
dimensional  structure,  which  is  necessary  for  its  activity. 

Many  proteins,  called  enzymes,  act  as  exquisitely  selective  catalysts  for  specific 
chemical  reactions,  allowing  these  reactions  to  take  place  dramatically  faster  than  they 
would  under  normal  circumstances.  The  presence  or  absence  of  an  enzyme  effectively 
switches  reactions  on  and  off  within  a  cell. 

The  amino  acid  sequence  (and  thus  the  properties)  of  a  particular  protein  is  con¬ 
trolled  by  the  sequence  of  DNA  codons  in  the  associated  gene.  Triplets  of  the  four 
DNA  nucleotides.  A,  T,  G,  and  C  specify  one  of  64  code  words.  These  64  code  words 
specify  a  start  code,  three  stop  codes,  and  a  redundant  specification  of  which  of  amino 
acids  should  be  inserted  next  into  a  partially  constructed  protein  molecule. 

The  information  in  the  DNA  is  not  directly  used  to  manufacture  protein.  Instead,  the 
primary  DNA  sequence  for  a  gene  is  first  copied,  in  a  process  called  transcription,  into 
an  intermediate  form  of  RNA,  called  messenger  RNA  (mRNA).  This  copying  process  is 
under  the  control  of  an  enzyme  complex  called  RNA  polymerase.  The  copying  process 
is  not  automatic,  and  control  is  carefully  exercised  over  which  portions  of  the  DNA  are 
copied  into  mRNA.  We  will  use  these  control  mechanisms  as  the  basis  for  our  logic 
gates. 

The  mRNA  transcript  is  then  (often  in  a  pipelined  manner)  used  by  an  enzyme/RNA 
complex  called  the  ribosome  to  manufacture  proteins.  This  process  of  manufacturing 
proteins  from  mRNA  transcripts  is  called  translation.  The  protein  manufacturing  pro¬ 
cess  is  sometimes  used  as  a  control  mechanism,  but  is  far  less  widely  used  than  control 
of  mRNA  synthesis.  mRNA  is  degraded  quite  rapidly  by  the  cell.  The  rate  of  mRNA 
degradation  is  sequence  dependent.  Thus  protein  synthesis  requires  continual  replen¬ 
ishment  by  the  creation  of  new  mRNA  copies  by  RNA  polymerase  from  the  primary 
DNA  gene. 

Proteins  are  also  gradually  degraded  within  the  cell,  at  a  sequence  dependent  rate. 
The  continuing  presence  of  a  particular  protein  thus  depends  upon  its  creation  by  the 
translation  of  mRNA  transcripts. 
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2.2  Control  of  Gene  Expression 

The  creation  of  mRNA  transcripts  is  carefully  controlled  within  the  cell.  Each  DNA 
coding  sequence  (gene)  is  accompanied  by  an  upstream  control  region,  consisting  of 
non-coding  DNA  sequences.  Some  of  these  sequences  signal  the  binding  location  for 
RNA  polymerase,  the  enzyme  which  catalyzes  the  creation  of  mRNA.  Other  sequences 
are  the  binding  sites  for  either  repressors  or  promoters,  which  are  proteins  that  selec¬ 
tively  bind  to  specific  DNA  sequences  within  the  control  region. 

Repressor  binding  sites  typically  overlap  the  RNA  polymerase  binding  site — a  pro¬ 
tein  bound  to  this  site  physically  interferes  with  the  binding  of  RNA  polymerase.  Pro¬ 
moter  binding  sites  are  typically  located  some  distance  from  the  RNA  polymerase  bind¬ 
ing  site,  and  the  binding  of  a  promoter  to  such  a  site  makes  it  easier  for  RNA  polymerase 
to  bind  and  initiate  mRNA  production. 

In  many  cases,  a  single  gene-control  sequence  contains  several  promoter  and  re¬ 
pressor  regions.  In  this  proposal,  we  make  use  of  only  repressor  DNA  binding  proteins. 

2.3  Protein  Dimers  and  Cooperative  Binding 

An  essential  aspect  of  any  digital  logic  gate  is  a  high-quality  nonlinearity.  In  essence,  a 
digital  gate  must  exhibit  low  gain  for  signals  near  a  logical  zero  or  one,  while  exhibiting 
high  gain  for  signals  within  the  transition  regions. 

The  biological  world  utilizes  two  chemical  techniques  to  achieve  this  highly  non¬ 
linear  behavior. 

The  first  technique  is  the  use  of  protein  dimers  as  the  biologically  active  form.  The 
active  form  of  many  enzymes,  including  the  DNA  binding  proteins  we  propose  using,  is 
a  bound  combination  of  two  copies  of  the  protein.  In  equilibrium,  the  concentration  of 
the  dimer  is  proportional  to  the  square  of  the  protein  concentration.  Higher  power  non- 
linearities  can  be  achieved  with  tetramers,  hexamers,  or  even  higher  multimers.  Such 
multimers  are  common  in  biologically  active  protein  complexes. 

A  similar  power  law  behavior  is  obtained  in  cooperative  binding  of  proteins  to  a 
substrate.  Cooperative  binding  refers  to  mechanisms  in  which  the  first  of  several  protein 
binding  reactions  occurs  at  a  relatively  low  rate,  while  subsequent  binding  reactions 
occur  more  rapidly,  because  the  presence  of  the  already  bound  protein  enhances  the 
binding  affinity. 

2.4  Related  Work 

There  has  been  considerable  work  on  the  analysis  of  naturally  occurring  genetic  regu¬ 
latory  networks,  however  we  believe  that  this  is  the  first  attempt  to  develop  a  synthetic 
design  discipline  for  the  construction  of  novel  genetic  regulatory  networks  based  on  a 
digital  abstraction. 


Of  course,  this  work  builds  on  the  wonderful  analytic  work  of  generations  of  biol¬ 
ogists.  It  is  directly  inspired  by  our  understanding  of  lambda  phage  lysogeny.  This  is  a 
naturally  occurring  genetic  switch  that  controls  the  lytic/lysogenous  phase  in  bacterial 
cells  infected  with  the  A  phage.  Ptashne’s  classic  book  on  the  subject  is  required  reading 
to  instill  modesty  among  those  who  would  engineer  these  systems. 

While  the  details  of  the  switch  mechanism  are  (typically)  substantially  more  com¬ 
plex  than  the  techniques  proposed  here,  it  is  striking  that  both  the  CRO  and  A  repressor 
are  dimeric  protein  complexes,  and  that  both  interact  cooperatively  with  gene  control 
sites. 

Serious  simulation  studies  by  Harley  McAdams,  Matthew  Scott,  and  Adam  Arkin 
have  substantially  advanced  our  knowledge  of  the  lambda  phage  switch  and  of  other 
naturally  occurring  genetic  regulatory  networks.  In  particular,  McAdams  has  shown 
that  the  noise  associated  with  the  discrete  nature  of  the  chemical  process  plays  an  im¬ 
portant  role  in  providing  population  diversity  that  may  enhance  survival.  Of  course,  as 
engineers  we  need  to  be  able  to  arrange  that  our  systems  are  reliable  in  the  face  of  such 
noise. 

An  important  qualitative  understanding  of  the  nature  of  genetic  regulatory  networks 
in  the  context  of  evolution  and  development  was  developed  by  Stuart  Kauffman. 


3  A  Biologically  Plausible  Gate 

We  can  use  the  naturally  occurring  mechanisms  of  controlled  mRNA  transcription,  re¬ 
pressors,  cooperative  binding,  and  the  degradation  of  mRNA  and  proteins  as  a  way  to 
implement  a  logical  inverter. 

The  “signals”  in  our  logic  system  consist  of  concentrations  of  specific  DNA  binding 
proteins,  which  act  as  repressors.  These  concentrations  can  be  thought  of  as  a  simple 
integer  count  of  how  many  protein  molecules  of  a  particular  type  exist  within  a  single 
cell. 

The  “inverters”  in  our  logic  system  consist  of  genes — specific  DNA  coding  regions, 
along  with  their  control  sequences — which  code  for  the  production  of  specific  proteins. 
Normally,  the  coded  proteins  are  themselves  DNA  binding  proteins,  which  are  used  as 
inputs  to  other  such  inverters.  They  could,  of  course,  code  for  enzymes  which  effect 
some  other  action  within  the  cell,  such  as  motion,  illumination,  or  chemical  reactions. 
Similarly,  the  input  of  our  gates  could  consist,  not  of  the  output  of  another  logic  gate, 
but  of  a  sensor  which  creates  a  DNA  binding  protein  in  response  to  illumination,  a 
chemical  in  the  environment,  or  the  concentration  of  specific  intracellular  chemicals. 

One  additional  feature,  also  present  in  some  naturally  occurring  transcription  con¬ 
trol  mechanisms,  is  used  in  our  gate  to  control  the  output  level  of  the  DNA  binding 
protein  product.  Specifically,  the  presence  of  large  concentrations  of  DNA  binding  pro¬ 
tein  produced  by  a  particular  gene  is  used  to  inhibit  the  transcription  of  more  copies  of 
its  mRNA.  This  results  in  a  predictable  amount  of  the  gene  product  when  the  gene  is 
turned  on. 

Given  a  way  of  constructing  inverters — nonlinear  inverting  amplifiers  with  suffi¬ 
cient  gain  and  nonlinearity  to  obtain  the  required  noise  margins — it  is  easy  to  construct 


any  logical  circuit.  Multi-input  logic  functions  are  performed  in  our  model  by  con¬ 
structing  multiple  inverters  (genes),  each  having  the  same  binding  protein  as  a  product, 
but  with  different  control  inputs.  In  many  respects,  this  is  similar  to  the  I^L  integrated 
circuit  logic  family. 

4  The  Chemical  Model  Reactions  and  their  Kinetic  Equations 

In  this  section,  we  consider  the  detailed  chemical  reactions  of  a  single  inverter.  A  de¬ 
tailed  understanding  of  the  static  and  dynamic  behavior  of  the  inverter  is  the  key  to 
digital  gate  design.  More  complex  gates  are  simple  extensions  of  this  understanding. 
In  the  case  of  our  proposal,  complex  gates  are  formed  simply  by  constructing  inverters 
with  distinctive  control  sequences,  but  which  produce  the  identical  gene  products. 

Consider  the  inverter  with  DNA  binding  protein  input  A  and  DNA  binding  pro¬ 
tein  output  B.  B  is  manufactured  by  ribosomes  acting  on  mRNA  copied  from  a  DNA 
coded  gene,  Bq  ■  B  is  destroyed  by  scavenging  mechanisms  which  continually  degrade 
cellular  protein. 

The  rate  of  production  of  the  mRNA  transcripts  is  controlled  by  promoter  and  re¬ 
pressor  regions  upstream  from  the  coding  region  of  the  gene  for  B,  Bq.  In  this  example, 
we  will  assume  that  both  A  and  B  act  as  repressors  for  the  transcription  of  the  Bq. 

We  will  model  the  binding  of  the  DNA  binding  proteins  A  and  B  as  reversible 
chemical  reactions,  transforming  the  gene  Bq  into  a  repressed  (inactive)  form  BqA 
and  BqB  respectively.  Note  that  we  require  n  molecules  of  A  or  B  to  inactivate  the 
gene — this  is  the  source  of  the  needed  nonlinearity.  We  will  obtain  excellent  inverter 
behavior  with  n  =  4. 


Bq  +  nA 

-^^BqA 

binding 

Bq  +  nB 

BqB 

binding 

BqA 

- - - 5-  Bq  +  nA 

dissociation 

BqB 

- - - 5-  Bq  +  nB 

dissociation 


Only  the  active  form,  Bq  will  be  transcribed  and  translated  to  form  B  in  the  essen¬ 
tially  irreversible  reaction 

Bq  - ^ ^  Bq  +  B. 

RNA  polymerase,  ribosome 

The  breakdown  of  A  and  B  is  modelled  by  the  reactions 


breakdown 

B  - - - 5-  . 


breakdown 


The  kinetic  equations  for  this  chemical  mechanism  can  be  written  as  a  set  of  cou¬ 
pled  differential  equations  corresponding  to  the  conservation  of  material.  We  added  a 
“drive”  term,  which  is  not  part  of  the  chemistry,  to  allow  us  to  manipulate  the  system 
numerically,  by  setting  a  schedule  for  production  of  the  protein  A.  This  would  not  be 
present  in  any  biological  implementation,  but  then  the  input  variable  [A]  would  be  cou¬ 
pled  to  the  rest  of  the  system  by  being  the  output  variable  of  some  other  gate  or  sensor. 


+  kalBoA]  +  k4BGB]  (1) 

=  kMnBG]  -  ko^BGA]  (2) 

=  k2[BnBG]  -  k^BGB]  (3) 

^  =  k4BG]+nk4BGB]-nk2[B]4BG]-k4B]  (4) 

=  drive  -b  nkz[BGA]  -  k^A]  -  (5) 


The  bound  and  unbound  copies  of  the  B  gene  are  conserved,  and  equal  to  the  gene 
copy  number  in  the  cell,  so  we  may  define  [Bq]  as  the  concentration  of  any  form  of  Bq: 


[Bo\  =  [Bg]  +  [BgA]  +  [BgB]  .  (from  1,2,3)  (6) 


In  equilibrium,  the  time  derivatives  of  concentrations  are  zero,  giving  us  the  equa¬ 
tions: 


[BgA] 

[BgB] 

[Bg] 


k^Bo]  +nk4[BGB] 
drive  +  nkz  [BgA\ 


^[AnBo]  (7) 

^[S]”[Sg]  (8) 

[Bo]  -  [BgA]  -  [BgB] 

[Bo]  -  |[^]”[Sg]  -  Y^mBa] 


l  +  f^[^]”  +  |j[B]" 

(9) 

nA:2[B]”[BG]+A:7[B] 

(10) 

k^A]  +  nki[A]4BG] 

(11) 

Simplifying  further,  we  arrive  at  the  equilibrium  input/output  relationship  between 
[A]  and  [B]: 


[Bg] 

'Hb] 

'5 

[Bo] 

[A]" 

[A]" 


[B] 


[Bi 


'0 


B]  + 


kjki 


Ml  Ml"  mi  +  Mimi"+1 
kzks  kzki 

-  fe[B] 


(from  10,  8) 
(from  12,  9) 


Letting  ct  =  l3  =  ^  and  7  =  we  arrive  at  the  equation 


(12) 

(13) 

(14) 

(15) 

(16) 


a[A]”  =  7^  -  P[Br  -  1  (17) 

This  relationship  defines  the  input/output  transfer  function  of  our  system.  The  pa¬ 
rameters  a,  /),  and  7  are  ratios  of  kinetic  constants,  and  [Bq]  is  the  conserved  concen¬ 
tration  of  the  gene. 


5  Equilibrium  Behavior 

We  set  n  =  4.  We  have  found  that  this  value  produces  a  good  quality  inverter. 

We  need  to  put  further  constraints  on  our  concentrations  to  make  a  useful  inverter. 
In  particular,  we  want  the  output  of  an  inverter  to  be  a  signal  in  the  same  analog  range 
as  the  input,  and  we  may  require  the  output  to  go  through  the  halfway  point  just  when 
the  input  does.  These  constraints  will  restrict  the  possible  values  of  the  parameters  a, 
l3,  7,  and  [Bq]. 

Let’s  start  by  declaring  that  the  range  of  signals  is  the  interval  [0,1].  So  we  require 
that  [A]  =  0  when  [B]  =  1.  This  is  a  free  choice,  because  it  just  determines  the  units 
that  we  use  to  measure  the  concentrations.  We  can  also  require  that  [A]  =  1/2  when 
[B]  =  1/2.  This  is  an  actual  restriction  on  our  transfer  characteristic.  Plugging  these 
constraints  into  the  equation  (17)  results  in  the  relationships: 


p  =  7[Bo]  -  1  (18) 

a  =  3l7[Bo]  -  15  (19) 

These  leave  us  with  exactly  one  free  parameter  for  the  transfer  characteristics  of  our 
inverter,  7[ilo]-  Do  we  get  a  good  inverter  for  reasonable  values  of  this  parameter?  The 
answer  is  yes. 


Figure  1  shows  that  the  relationship  in  equation  (17)  between  the  input  and  output 
concentrations  yields  the  classic  transfer  curve  of  a  good  digital  inverter.  (Here  n  =  4 
and  ^\Bq\  =  2.5.)  Note,  in  particular,  the  low  gain  for  high  and  low  input  concen¬ 
trations,  separated  by  a  relatively  high-gain  transition  region.  This  nonlinearity  is  the 
essence  of  digital  gates,  and  forms  the  basis  for  effectively  rejecting  small  variations  in 
the  input  signals — that  is,  for  attenuating  the  input  noise. 


Fig.  1.  The  DC  transfer  curve  for  our  mechanism  is  similar  to  the  characteristic  of  an  NMOS 
inverter.  The  abscissa  is  the  the  concentration  of  the  input  protein  [A]  and  the  ordinate  is  the 
concentration  of  the  output  protein  [B].  The  scales  of  both  axes  are  [0, 1]. 


It  is  encouraging  that  the  proposed  inverter  shows  promising  low  sensitivity  to  vari¬ 
ations  in  the  chemical  rate  constants.  The  sets  of  curves  shown  in  figures  2  and  3  display 
the  results  of  halving  and  doubling  of  each  of  the  constants  a,  (i  in  equation  (17),  while 
holding  ^\Bq\  constant.  Of  course,  changing  these  constants  violates  our  constraints 
(equations  (18,19))  but  the  resulting  system  is  still  quite  serviceable. 

What  is  even  more  spectacularly  encouraging  is  that  the  value  of  ^\Bq\  can  be  varied 
over  a  huge  range  without  making  this  inverter  unusable.  In  figure  4  we  see  that  varying 
"i\Bq\  over  the  entire  range  [1, 50]  has  almost  no  effect  on  the  characteristic  shape  of 
our  inverter. 

Of  course,  this  is  still  useless  unless  the  kinetic  constants  for  biologically  feasible 
reactions  are  within  these  ranges.  We  have  not  yet  done  this  analysis,  but  we  hope  to 
have  results  to  report  shortly. 


Fig.  2.  Variation  in  the  parameter  a  affects  the  inverter  threshold  of  the  gate.  Here  we  see  the 
effect  of  a  factor  of  four  variation  in  a.  The  scales  of  both  axes  are  [0, 1]. 


Fig.  3.  Variation  in  the  parameter  /3  affects  the  concentration  representing  a  logic  one.  Here  we 
see  the  effect  of  a  factor  of  four  variation  in  (3.  The  scale  of  the  abscissa  is  [0, 1]  and  the  scale  of 
the  ordinate  is  [0,  2]. 


Fig.  4.  Variation  in  the  parameter  7[Bo]  has  almost  no  effect  on  the  inverter  characteristic.  Here 
we  vary  7[Bo]  over  a  factor  of  fifty.  The  scales  of  both  axes  are  [0, 1]. 


6  Dynamic  Behavior 

Static  transfer  curves  do  not  address  the  dynamic  behavior  of  logic  circuits.  Although 
the  static  transfer  characteristic  of  the  system  was  determined  by  one  free  parameter, 
the  dynamic  characteristics  are  much  more  complicated.  Even  given  the  constraints  on 
the  transfer  characteristic  of  the  last  section  the  dynamics  depends  on  choices  of  7, 
[Bo]^  kz,  ki,  ke,  and  kj.  We  have  not  yet  explored  this  space,  so  we  show  here  the 
behavior  of  only  one  choice.  In  the  simulations  we  have  chosen  the  following  values 
for  the  free  parameters — we  have  not  tried  to  scale  these  for  biologically  plausible  time 
scales  and  concentrations.  But  similar  behavior  can  be  obtained  over  very  wide  ranges 
of  variations  of  these  parameters. 

n  =  4, 7  =  25,  Bo  =  0.1,  ^3  =  0.2,  k4  =  0.03,  fee  =  1.0,  fey  =  1.5 

In  hgure  5  we  show  behavior  of  our  gates  when  stimulated  with  a  abrupt  transition 
(square  wave)  input  signal,  by  turning  on  and  off  production  of  protein  A.  The  concen¬ 
tration  of  A  then  changes,  as  shown  in  the  simulation,  due  to  the  sudden  production 
change,  in  combination  with  the  concentration-dependent  destruction  rate,  and  its  bind¬ 
ing  to  Bq.  The  binding  of  A  to  DNA  inhibits  production  of  B,  entailing  the  changes  in 
the  concentration  of  B,  thus  yielding  the  inverter  behavior  desired. 

Note  that  the  simulation  shown  in  hgure  5  the  inverter  is  not  loaded  by  connection 
to  the  next  stage.  We  must  show  that  the  inverters  work  with  an  attached  load.  One  way 
to  do  this,  which  also  shows  other  important  features  of  the  dynamics,  is  to  hook  three 


Fig.  5.  The  top  trace  shows  the  externally-imposed  drive,  the  rate  of  production  of  protein  A.  The 
middle  trace  shows  the  response  of  [A].  The  bottom  trace  shows  the  response  of  [B].  The  two 
rises  of  [B]  do  not  look  the  same  because  the  simulation  was  started  with  an  arbitrary  initial  state. 
Subsequent  rises  of  [B]  would  look  very  much  like  the  second  one. 


of  these  inverters  up  to  make  a  ring  oscillator.  Simulation  of  this  oscillator  (see  figure  6) 
demonstrates  that  we  actually  have  working  inverting  amplifiers.  In  this  simulation  we 
started  with  a  random  initial  condition.  We  see  that  our  parameters  are  not  particularly 
optimal,  but  the  system  oscillates  with  reasonable  swing. 


7  Implementation  Issues 

Any  new  digital  logic  family  has  important  characteristics  and  limitations  which  must 
be  understood  in  order  to  effectively  design  with  them.  In  this  section,  we  have  at¬ 
tempted  to  predict  some  of  these  issues,  but  inevitably  there  will  be  many  which  we 
will  overlook  until  real  implementations  of  these  gates  is  underway. 

7.1  These  Gates  are  Slow 

Perhaps  the  most  dramatic  difference  between  our  biological  gates  and  conventional 
logic  gates  is  the  difference  in  speed.  Electrical  gates  now  function  with  delays  of 
tens  of  picoseconds.  Biological  gates  constructed  using  this  methodology  will  have  de¬ 
lays  governed  by  the  speed  of  protein  manufacturing — perhaps  many  minutes.  Roughly 
speaking,  we  should  think  of  this  logic  family  as  functioning  at  frequencies  measured 
in  millihertz,  rather  than  at  rates  measured  in  Megahertz. 


Fig.  6.  The  ring  oscillator  oscillates  at  a  rather  high  frequency,  limited  by  the  rise  time,  the  fall 
time,  and  the  storage  time  of  the  inverter. 

While  other  biological  mechanisms  might  be  constructed  which  could  function  (op¬ 
timistically)  at  kilohertz  rates,  such  structures  will  require  a  degree  of  engineering  fi¬ 
nesse  which  we  believe  will  not  be  available  in  the  short  term.  Protein  design,  and,  es¬ 
pecially,  protein-complex  design,  required  for  such  high-speed  gates,  is  not  sufficiently 
well  understood  at  the  present  time.  Indeed,  the  major  requirement  may  be,  ironically, 
better  and  faster  computational  tools. 

7.2  Complexity  Limitations 

A  critical  resource  in  the  design  of  complex  logic  circuits  within  a  cell  is  the  availabil¬ 
ity  of  a  sufficient  number  of  distinct  DNA  binding  proteins.  Not  only  must  many  such 
proteins  be  found,  but  the  set  employed  must  not  be  used  elsewhere  within  the  host  cell 
control  mechanisms.  We  estimate  that  tens  to  hundreds  of  such  proteins  exist  naturally. 
With  luck,  we  can  perhaps  learn  how  to  engineer  many  others  within  this  limited  do¬ 
main.  Another  approach  is  to  control  the  expression  of  genes  that  code  for  antisense 
RNA.  Selective  binding  of  these  can  be  used  to  inhibit  the  production  of  other  gene 
products. 

A  second  limitation  comes  from  the  requirement  of  finite  concentrations  of  these 
proteins  within  the  cell.  Cells  have  finite  volume,  and  the  requirement  that  many  copies 
of  a  protein  exist  to  have  effect,  together  with  the  complexity  requirement  for  many 
distinct  proteins,  leads  to  an  upper  limit  on  the  logic  complexity  which  can  be  performed 
within  a  single  cell.  We  are  confident,  however,  that  logic  circuits  of  an  interesting  level 
of  complexity  can  be  constructed. 


7.3  Cell  Cycle  Coordination 


Cells,  particularly  bacterial  cells,  are  not  static  objects.  They  undergo  a  growth  and 
division  cycle  which  has  a  profound  effect  on  the  biochemistry.  For  example,  the  value 
of  [Bo],  the  total  number  of  copies  of  the  gate  gene  within  the  cell,  varies  over  time 
because  the  DNA  is  being  replicated  prior  to  cell  division.  Gates  must  be  robust  against 
such  potentially  major  changes  in  internal  cell  chemistry. 

7.4  What  Organism? 

The  choice  of  organism  is  critical.  One  view  is  that  we  should  experiment  with  the 
simplest  possible  organism,  perhaps  Mycoplasma  capricolum,  which  has  the  advantage 
of  a  small  genome,  and  is  already  sequenced.  But,  it  is  difficult  to  culture,  and  few 
workers  have  experience  with  it. 

Another  view  espouses  the  standard  biological  prokaryote,  Eschericia  coli.  It,  too 
is  now  sequenced,  although  with  a  genome  twelve  times  as  large.  Extensive  experience 
and  the  wide  availability  of  tools  argue  strongly  for  its  adoption. 

A  third  view  proposes  that  we  use  a  eukaryote,  specifically  the  yeast,  S.  cervisiae, 
as  an  ideal  organism,  because  of  its  richer  genetics,  isolated  nucleus,  and  more  complex 
transcription  mechanisms. 

Our  current  approach  favors  the  use  of  E.  coli  due  to  its  wide  availability  and  over¬ 
whelming  popularity.  Eventually,  the  advantages  of  using  a  dramatically  simplified  cell 
seem  large.  Engineering  such  a  simplified  cell  may  be  an  important  early  goal. 


8  Applications 

Cellular  computing  opens  a  new  frontier  of  engineering  that  will  dominate  the  technol¬ 
ogy  of  the  next  century.  Employing  information  technology,  the  future  holds  promise 
for  the  development  of  means  to  organize  and  control  biological  processes  that  are  just 
as  effective  as  our  current  mastery  of  electrical  processes. 

In  particular,  biological  cells  are  self-reproducing  chemical  factories  that  are  con¬ 
trolled  by  a  program  written  in  the  genetic  code.  Current  progress  in  biology  will  soon 
provide  us  with  an  understanding  of  how  the  code  of  existing  organisms  produces  their 
characteristic  structure  and  behavior.  As  engineers  we  can  take  control  of  this  process 
by  inventing  codes  (and  more  importantly,  by  developing  automated  means  for  aiding 
the  understanding,  construction,  and  debugging  of  such  codes)  to  make  novel  organisms 
with  particular  desired  properties. 

Besides  the  obvious  application  of  control  of  biological  processes  to  medicine,  we 
will  be  able  to  co-opt  biological  processes  to  manufacture  novel  materials  and  structures 
at  a  molecular  scale.  The  biological  world  already  provides  us  with  a  variety  of  useful 
and  effective  mechanisms,  such  as  flagellar  motors.  If  we  could  co-opt  cells  to  build 
organized  arrays  of  such  motors,  with  accessible  interfaces  for  power  and  control,  we 
could  see  how  this  could  be  of  engineering  significance.  Common,  biologically  avail¬ 
able  conjugated  polymers,  such  as  carotene,  can  conduct  electricity,  and  can  be  assem¬ 
bled  into  active  components.  If  we,  as  engineers,  can  acquire  mastery  of  mechanisms  of 


biological  differentiation,  morphogenesis,  and  pattern  formation,  we  can  use  biological 
entities  of  our  own  design  as  construction  agents  for  building  and  maintaining  complex 
ultramicroscopic  electronic  systems.  Such  systems  will  have  better  performance  and 
reliability  then  technologies  based  on  less  precisely  controlled  chemical  processes.  Of 
course,  one  of  the  most  important  products  of  mass-produced  molecular-scale  engineer¬ 
ing  will  be  extremely  compact,  efficient,  and  effective  computing  mechanisms. 

Thus,  in  spite  of  the  long  gate  delays  in  cellular  computing  mechanisms,  the  fact 
that  cells  can  reproduce  and  organize  into  precisely  arranged  and  differentiated  tissues 
means  that  we  can  use  them  as  the  (very  slow)  agents  of  molecular-scale  manufacturing 
of  macroscopic  objects.  It  is  the  resulting  objects  that  we  desire — they  may  contain 
electrical  circuitry  with  picosecond  cycle  times.  The  slow  biological  systems  are  our 
machine  shops,  with  proteins  as  the  machine  tools,  and  with  DNA  as  our  control  tapes. 
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